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Abstract 

Based  upon  modern  statistical  theory  and  econometric  methods,  this 
study  investigates  the  impacts  of  skewness  and  kurtosis  on  risk  estima- 
tion and  determination.   It  was  found  that  both  skewness  and  kurtosis  of 
security  rates  of  return  are  important  in  capital  asset  pricing  determina- 
tion. 


I .   Introduction 

It  has  been  known  for  some  tine  that  the  returns  of  security  and 
the  residuals  from  market  model  regressions  are  not  exactly  normally 
distributed.  Mandelbrot  (1963)  first  demonstrated  that  stock  return 
did  not  follow  the  normal  distribution.   Mandelbrot  (1963,  67)  and 
Fielitz  (1971)  showed  that  the  variation  in  stock  prices  is  not  sta- 
tionary.  Fama  (1965,  76)  and  Mandelbrot  suggested  a  stable  class 
distribution  as  the  probability  model  for  the  distribution  of  stock 
prices.   Fama  estimated  the  characteristic  exponents  for  the  stable 
Paretian  distribution  to  be  somewhere  from  1.7  to  1.9  in  contrast  to 
a  value  of  2.0  for  the  normal  distribution  and  1.0  for  the  Cauchy  dis- 
tribution.  Blume  (1968)  also  provided  empirical  evidence  that  a 
characteristic  exponent  between  1.7  and  1.8  summarized  the  residuals 
from  market  model  regressions  rather  well. 

On  the  other  side,  Officer  (1972)  found  that  the  standard  devia- 
tion of  the  distribution  of  daily  and  monthly  stock  returns  is  a  well 
behaved  dispersion  measure.   This  finding  is  inconsistent  with  the 
findings  of  stable  Paretian  distribution  by  Fama  and  others,  since  the 
standard  deviation  is  expected  to  be  large  and  behave  erratically  if  the 
stock  prices  follow  a  stable  class  distribution.   Officer  also  provided 
evidence  that  some  of  the  properties  of  the  stock  return  distribution 
are  not  consistent  with  the  stable  hypothesis.   Specifically,  the 
tine-series  sums  of  daily  stock  returns  became  "thinner-tailed"  for 
larger  sums,  and  the  cross-sectional  sums  of  monthly  stock  residuals 
from  the  market  model  had  a  characteristic  exponent  larger  than  that  of 
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component  stocks.   Hsu,  Miller  and  Wichern  (1974)  chose  the  minimum 
chi-square  goodness-of-f it  procedure  to  test  the  stable  hypothesis. 
Their  results  for  four  companies  showed  that  symmetric  stable  dis- 
tributions are  not  consistent  with  the  daily  data  although  the  stable 
Paretian  model  fits  better  on  the  monthly  rate  of  returns.   Based  upon 
their  findings,  Hsu,  Miller  and  Wichern  (74)  proposed  a  normal  proba- 
bility model  with  a  nonstationary  variance  subject  to  step  changes  at 
irregular  time  points  corresponding  to  shifts  of  various  exogenous 
factors . 

The  determination  of  stock,  return  distribution  is  important  for 
the  risk  estimation.   Fisher  (1932)  showed  that  the  dispersion  of 
sample  variance  is  a  function  of  the  fourth  moment.   This  implies  the 
measured  total  risk  of  stocks  will  be  affected  by  the  value  of  kurtosis 
which  appear  to  be  positive  much  often  than  negative. 

The  nonstationarity  of  stock  returns  probability  distributions 
and  distribution  parameters  have  led  to  the  instability  of  beta  sys- 
tematic risk  estimates.   Francis  (1979)  showed  that  the  stability  of 
beta  coefficient  estimated  for  each  individual  security  can  be  affected 
by  three  components:   (1)  correlation  coefficient  between  the  indi- 
vidual security  and  the  market  portfolio;  (2)  the  standard  deviation 
of  individual  security  returns;  and  (3)  the  standard  deviation  of  mar- 
ket returns.   In  a  three  parameter  capital  asset  pricing  model,  Kraus 
and  Litzenberger  (1976)  and  Lee  (1976)  showed  that  both  beta  systematic 
risk  and  gamma  systematic  skewness  estimated  from  the  quadratic  char- 
acteristic lines  are  partly  determined  by  variance,  skewness,  kurtosis 
and  the  investment  horizon  of  the  market  returns.   Therefore,  to  resolve 
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the  problem  cf  risk  estimation  and  understand  the  short  run  and  long  run 
stability  of  risk  measures,  we  need  to  closely  examine  the  behavior  of 
skewness  and  kurtosis. 

In  the  estimation  of  beta  systematic  risk,  the  least  squares  esti- 
mator will  be  best  linearly  unbiased  only  if  classical  assumptions  on 
linear  model  are  satisfied.   One  of  these  assumptions  is  the  normality 
of  the  observations.   If  this  assumption  is  violated  due  to,  for 
example,  the  existence  of  skewness  and  high  kurtosis,  then  the  least 
regression  (LS)  can  no  longer  provide  the  best  linear  unbiased  esti- 
mator.  Under  this  condition,  other  robust  estimators  may  be  superior 
to  the  LS  estimator,  unless  either  the  original  data  are  transformed 
to  satisfy  the  classical  assumptions  of  the  linear  model,  or  the  skew- 
ness and  kurtosis  are  not  serious. 

It  is  well-known  that  the  mean-absolute-deviation  (MAD)  method  has 
been  suggested  as  an  alternative  to  the  least  squares  regression  in  es- 
timating the  regression  parameters  for  a  fat-tailed  distribution.   The 
appeal  of  MAD  apparently  comes  from  the  fact  that  it  gives  relatively 
less  weights  to  outliers  as  compared  to  LS  regression.   Theoretically 
the  MAD  estimator  should  be  more  efficient  than  LS  estimator  when  ap- 
plied to  the  fat-tailed  distributions.   However,  the  empirical  reports 
on  the  performance  of  MAD  estimator  varied  widely.   Meyer  and  Glauber 
(1964),  Winginton  (1968)  found  that  MAD  regression  outperformed  or  at 
least  provided  no  worse  forecasting  power  than  the  LS  regression. 
Ruppert  and  Carroll  (1980)  reported  that  MAD  will  be  more  efficient 
only  if  the  kurtosis  is  very  high.   On  the  other  hand,  Wise  (1963) 
argued  that  neither  LS  nor  MAD  can  provide  the  best  linear  unbiased 
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estimatcr  for  a  characteristic  exponent  with  values  between  1  and  2. 
Sargent  (1969)  used  simulation  to  show  that  for  distributions  with 
characteristic  exponents  greater  than  1.7,  LS  regression  performs  bet- 
ter than  MAD  regression.   Two  recent  studies  which  are  directly  related 
to  the  systematic  risk  estimation  showed  no  sign  of  improvement  from 
MAD  regression.   Sharpe  (1971)  provided  evidence  that  two  methods  give 
similar  results.   Cornell  and  Dietrich  (1978)  found  that  the  short 
term  instability  or  beta  estimates  does  not  resulted  from  the  fat-tailed 
distribution.   Their  study  further  supported  Sharpe's  and  Sargent's 
findings . 

However,  these  comparisons  should  be  taken  more  cautiously.   When 
the  skewness  of  individual  security  return  is  significant,  the  effi- 
ciency of  MAD  regression  should  be  reduced  despite  the  existence  of 
high  kurtosis.   Furthermore,  the  sequential  parameter  variation  may 
be  due  to  various  problems  such  as  structural  changes,  misspecifica- 
tions,  aggregation,  and  error  in  variable.   If  these  probelms  are 
serious,  the  performance  of  MAD  method  become  difficult  to  evaluate 
before  problems  are  removed. 

The  purpose  of  this  study  is  to  examine  the  impacts  of  skewness 
and  kurtosis  on  the  determination  and  stability  of  risk  measures. 
The  classical  two  parameter  model  and  the  three  parameter  models 
recently  proposed  are  employed  to  estimate  different  risk  measures. 
Section  2  discusses  the  stability  of  total  risk.   The  theoretical 
relationship  between  total  risk  and  the  higher  moment  is  laid  out 
and  the  intertemporal  stability  of  total  risk  is  examined.   Section  3 
examines  the  stability  of  beta  risk  for  each  subgroup  with  similar 
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return  iistribu ticns.   Section  4  explores  che  impacts  of  skewness  and 
kurtosis  en  risk  decomposition.   Possible  impacts  of  skewness  and  kur- 
cosis  on  the  estimated  betas  are  also  investigated  in  accordance  with 
quadratic  market  model.   To  complete  the  investigation  on  the  effect 
of  distribution  parameters  on  the  risk  estimates,  Section  5  evaluates 
the  performance  of  mean-absolute-deviation  and  least  squares  estimators 
when  the  rates  of  return  are  not  normally  distributed.   Finally,  the 
results  of  this  study  are  summarized  in  Section  6. 

II.   The  Stability  of  Total  Risk 

Recent  research  in  risk  measures  has  indicated  that  the  variance 
of  security  return  is  more  stable  than  the  beta  systematic  risk. 
Francis  (1979)  showed  that  most  of  the  standard  deviations  of  individ- 
ual security  returns  are  fairly  stable  intertemporally,  and  suggested 
that  the  historical  standard  deviation  may  be  used  without  adjust- 
ments for  intertemporal  change  in  the  valuation  of  put  and  call  op- 
tions.  The  relative  stability  of  standard  deviation  also  implies 
that  Sharpe's  portfolio  performance  measure  may  be  superior  to 
Treynor's  or  Jensen's. 

The  stability  of  risk  statistics  is  an  empirical  question.   Never- 
theless, theoretically  the  higher  moment  of  returns  distributions  can 
affect  the  sample  variance.   If  security  returns  are  not  normally  dis- 
tributed, the  estimated  sample  variance  of  security  returns  is  more 
unstable  over  time  than  that  of  the  normal  distribution.   Fisher  (1932) 
showed  that  for  any  infinite  population,  the  variance  of  sample  variance 
from  random  samples  of  size  n  can  be  written  as 


4  k 

2      9CT         n  -  1   4 

(i)  vcs^-^t   d+IVri4) 

n  —  l        ^n    4 

a 

The  first  component  is  the  variance  of  sample  variance  given  that  the 
parent  distribution  is  normal.   The  second  component  represents  the 
influence  of  non-normality,   k  is  Fisher's  fourth  cumulant  defined 
as 

(2)  k4  =  E(Y.  -  E(Y.))4  -  3a4 

This  fourth  cumulant  is  zero  for  a  normal  distribution.  Notice  that 
the  stability  of  sample  variance  is  only  affected  by  the  relative  kur- 
tosis,  and  is  not  affected  by  skewness.  Also  the  effect  of  relative 
kurtosis  is  almost  independent  of  the  sample  size  and  will  remain  even 
with  large  sample.  Therefore,  using  only  the  sample  variance  as  the 
major  criterion  for  the  stability  of  risk  measures  may  be  misleading 
if  kurtosis  of  the  distribution  is  not  considered. 

The  stability  of  total  risk  is  reexamined  in  this  section.   Monthly 
returns  for  464  securities  and  the  market  portfolio  are  collected  from 
the  CRSP  tape.   The  sample  period  is  from  January  1959  to  December  1979. 
Data  are  first  grouped  into  three  nonoverlapping  periods:   1959-65, 
1966-72,  1973-79,  yielding  84  return  observations  for  each  period.   The 
monthly  treasure  bill  rate  is  used  as  a  proxy  for  the  risk  free  rate. 
Company  data  are  further  divided  into  two  groups  according  to  the  degree 
of  kurtosis  of  returns  distribution.   The  regression  model  is  specified 
as 

(3)  sj,t+l  =  a0  +al  Sj,t  +ut+l     C  =  1»2 
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The  sample  variance  of  security  in  the  current  period  is  regressed 
against  the  sample  variance  in  the  last  period.   Equation  (3)  is 
estimated  for  three  groups:   normality,  significant  kurtosis,  and 
all  (464)  companies.   Mincer  and  Zarnowitz's  (1969)  mean  square 
error  (MSE)  method  is  employed  to  decompose  the  sources  of  error. 
Results  are  displayed  in  Table  1. 

The  effect  of  kurtosis  on  the  stability  of  total  risk  is  significant. 
The  values  of  R  squares  drop  substantially  for  the  group  with  significant 
kurtosis  when  the  sample  variance  in  the  previous  period  is  used  to  esti- 
mate the  variance  in  the  current  period.   The  total  mean  square  errors  for 
the  group  with  significant  kurtosis  are  much  larger  than  those  for  the 
group  with  normality.   In  the  first  period  regression,  the  total  mean 
square  error  for  the  second  group  is  about  2.2  times  the  size  of  the 
first  group.   In  the  second  period,  this  ratio  goes  up  to  2.8.   Overall, 
the  major  estimation  errors  are  due  either  to  random  variation  or 
bias,  while  the  inefficiency  term  is  generally  small.   The  inefficiency 
term  is  slightly  larger  for  the  second  group  with  significant  kurtosis. 
Essentially,  inefficiency  measures  the  deviation  of  slope  value  from 
one.   As  it  can  be  seen,  the  slope  coefficients  of  the  second  group 
regression  are  significant  smaller  than  one.   Thus,  the  results  in 
Table  1  suggests  that  the  kurtosis  of  returns  distribution  must  be 
considered  when  the  historical  standard  deviation  is  used  to  estimate 
the  total  risk.   These  results  are  consistent  with  Cootner's  (1962) 
findings  that  relative  kurtosis  information  is  important  for  security 
analysis.   They  also  indirectly  support  Scott  and  Horvath  (1980) 


Table  1 
Summary   Statistics    for   the  Stability   of  local    Paste 


jrouD 


Slope 


R 


Mean  Sauare   Errors' 


Bias        Inefficiency      Random 


lota 


rqi 


(1)  Normality 
1959-65 
1966-72 

1966-72 
1973-79 

(2)  Significant 
kurtosis 

1959-65 
1966-72 

1966-72 
1973-79 

(3)  All   Companies 

1959-65 
1966-72 

1966-72 
1973-79 


1.167    .53    .0000058  .0000002  .0000093   .0000153 

38%        1%       61% 

.952    .56    .0000038  .0000000  .0000142   .0000181 
21%        0%        79% 


781    .39 


.846 

.917 
.921 


,45 


.45 


.53 


.0000045   .0000014   .0000281   .0000339 
13%        4%        83% 


.0000063   .0000010   .0000397 
13%        2%       85% 


.0000470 


,0000054  .0000001  .0000156   .0000211 
26%        0%        74% 

,0000044  .0000001  .0000210   .0000255 
17%        0%       83% 


*The  components  of  mean  square  error  are  reported  in  both  absolute  and 
percentage  terms. 
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:heoratical  analysis  on  the  direction  of  preference  for  moments  of 

1 
higher  order  than  the  variance. 

III.   The  Stability  of  Beta  Risk 

The  beta  risk,  is  determined  by  three  portions:   standard  devia- 
tion of  individual  security  and  market  portfolio  returns,  and  correla- 
tion coefficient  between  individual  security  and  market  portfolio 
returns.   Hence,  the  variance  (stability)  of  beta  is  affected  by  the 
variances  of  these  three  components.   Beta  therefore  is  intrinsically 
more  unstable  than  the  total  risk  of  individual  securities.   To  inves- 
tigate the  stability  of  beta,  equation  (4)  is  used  as  the  estimate  model, 
Again,  regressions  are  estimated  for  three  groups:   normality,  signifi- 
cant kurtosis  and  all  companies. 

(4)       Betaj,t+1  =b0+blBetaj}t+Vt+l      t-1.2 

Table  2  shows  the  regression  results.   The  slope  of  equation  (4) 
is  significant  less  than  one  in  every  case.   The  values  of  R  squares 
and  slope  coefficients  are  relatively  small  as  compared  to  the  results 
of  total  risk  regressions.   The  kurtosis  affects  the  stability  of 
beta  risk  indirectly  through  its  effect  on  the  total  risk.   The  re- 
sults for  the  second  group  with  significant  kurtosis  is  consistent  with 
this  line  of  reasoning.   The  R  squares  and  estimated  slopes  are  smaller, 
and  the  total  mean  square  errors  are  larger  for  the  second  group  than 
for  the  first  group. 

Note  that  the  impact  of  kurtosis  on  the  total  risk  is  more  sig- 
nificant than  on  the  systematic  risk.   Another  interesting  result  can 
be  found  in  the  structure  of  mean  square  errors.   Unlike  total  risk 
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Table  2 
Summary   Statistics    for   the   Stability  of   Systematic   Risk 


Group 


Slope 


Mean  Square  Errors 


Bias   Inefficiencv   Random 


Total 


(1)  Normality 
1959-651 
1966-72 

1966-721 
1973-79 

(2)  Significant 
kurtosis 


608  .29 


.487 


35 


.0094926 

.0190702 

.1119369 

.1404997 

7% 

13% 

80% 

.0011628 

.040  2199 

.0665661 

.1079488 

1% 

37% 

6  2% 

1959-651 
1966-72 

1966-721 
1973-79 

(3)    All   Companies 

1959-651 
1966-721 

1966-721 
1973-79 


.598  .24 


.495  .33 


,605  .27 


.489  .34 


.0091584 

.0225179 

.1539511 

.1856275 

5% 

12% 

83% 

.0046785 

.0574870 

.1118450 

.1740105 

3% 

33% 

64% 

.0093896 

.0201048 

.1251045 

.1545990 

6% 

13% 

81% 

.0018404 

.0448647 

.0781831 

.1248882 

1% 

36% 

63% 
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regression,  the  inefficiency  terra  becomes  a  major  source  of  fore- 
casting errors.   The  relatively  high  inefficiency  implies  that  beta 
estimates  are  intertemporally  more  unstable  than  the  variance  of  returns. 

IV.   The  Impacts  of  Skewness  and  Kurtosis  on  Risk  Decomposition 

In  this  section,  the  estimate  model  consistent  with  the  tradi- 
tional two  parameter  capital  asset  pricing  is  introduced,  and  the 
problem  of  functional  specification  is  investigated.   Three  versions 
of  Box-Cox  transformation  are  used  to  remove  the  skewness  and  kurtosis 
of  the  return  probability  distribution.   Following  this,  the  impacts 
of  skewness  and  kurtosis  on  the  specification  of  the  model  and  the 
distribution  of  disturbance  are  examined  more  closely.   Finally,  the 
model  developed  is  linked  to  the  three  parameter  capital  asset  pricing 
model . 

1.   Generalized  Functional  Form 

The  linear  model  based  on  the  traditional  capital  asset  pricing 
theory  may  not  be  an  appropriate  functional  form  to  explain  the  risk 
return  relationship  if  the  returns  of  securities  are  not  normally 
distributed.   Nevertheless,  there  may  exist  a  transformation  such  that 
the  transformed  observations  are  normally  distributed.   Two  methods  are 
considered  in  this  section.  The  first  method  is  the  Box-Cox  trans- 
formation, and  the  second  is  the  CES  function  approximation. 

A.   Box-Cox  Transformation 

A  special  class  of  transformation  developed  by  Box  and  Cox  (1964)  is 
useful  for  inducing  normality  on  observations  from  skewed  distributions. 
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In  empirical  econometrics,  the  Box— Cox  transformation  has  been  pri- 
marily used  as  a  device  for  generalizing  functional  form.   Following 
Fabozzi,  Francis  and  Lee  (1980)  [FFL],  a  generalized  functional  form 
of  specifying  the  market  model  can  be  written  as 

(5)       R*  =  a  +  b  r\  +  e_ 

it  mt    it 

where  X's  are  the  functional  form  parameters  for  the  dependent  and 
independent  variables.   Equation  (5)  reduces  to  the  linear  form  when  X 
is  equal  to  one;  and  becomes  the  log-linear  form  when  X  approaches 
zero.  Hence  equation  (5)  includes  both  linear  and  log-linear  forms  as 
special  cases  and  provides  a  generalized  functional  form  to  test 
whether  the  linear  model  is  appropriate  for  investigating  the  risk- 
return  relation.  There  are  at  least  two  interpretations  for  X. 
Statistically,  in  a  narrow  sense  lambda  is  the  functional  transforma- 
tion parameter  used  to  eliminate  the  skewness  and  kurtosis  of  the  depen- 
dent variable,  for  the  present  case,  the  rates  of  returns  of  individual 
securities.   In  a  broader  sense,  the  value  of  X  determines  the  correct 
specification  of  the  regression  model  which  can  either  be  linear  or 
nonlinear. 

In  the  finance  literature,  A  measures  the  ratio  of  the  true  in- 
vestment horizon  to  the  observed  horizon  in  the  equilibrium  risk  return 
relationship  as  used  by  Jensen  (1969)  and  Lee  (1976) .  When  the  value 
of  X  approaches  zero,  the  market  equilibrium  is  instantaneous,  provided 
that  the  returns  are  defined  over  inf initesimally  small  time  intervals. 
Jensen  (1969)  has  developed  instantaneous  systematic  risk  by  setting 
X  equal  to  zero. 
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To   perform  the   transformation,  FPL   suggests   chat  equation    (5)    can 
be  expressed   as 


(6)  r.      =  a'    +  br       +  e. 

it  nit  H 


where 


4- 

1 

it              \ 

r        =RA     - 
nt          mt 

1 

,    _    (a-ft)   -   1 

a    "  x — 


Eit   "   N(0'Ge) 


The  residual   analysis  method  considered  by  Box  and   Cox  is    used  to 
estimate  a  parameter  which  primarily  measures  the  degree  and  the 
direction  of  skewness.      The   value  of  this   parameter   for  each  indi- 
vidual security   is   approximately  equal   to  equation   (7). 


3>*  M  >r 

2  3       9      2 


(7) 


M^^  [(M4-3M-)    +-^  +  t-| 


"D 


2        1  2  MA        6M? 

6M2  +  i  [7(M4-3M")    +  12  -gji  +  — |] 

1  M. 


where  Mi  =  n-1   [R.   -   R. ] 1   for  i  =   2,    3,   k  ,   and  ^L    =   R. 


-14- 

Equation  (7)  defines  the  maximum  likelihood  estimate  of  a  as  a  func- 
tion of  the  first  four  moments.   The  value  of  functional  parameter  is 

X   =  1  -  a 

The  empirical  results  listed  here  are  limited  to  the  cross- 
sectional  risk  return  regressions  pioneered  by  Lintner  (1965)  and 

2 
Douglas  (1969).''   Three  types  of  transformations  were  performed:   (1) 

transformation  of  both  independent  and  dependent  variables;  (2)  trans- 
formation of  dependent  variable  only;  and  (3)  transformation  of  independent 
variable  only.   Table  3  only  reports  the  results  from  the  regression  of 
the  first  type  of  transformation.   After  transformation  most  of  the 
security  returns  pass  the  skewness  test.   Table  3  indicates  that  the 
coefficient  of  residual  variance  is  significant  in  only  one  case  in 
period  1  when  average  security  return  is  regressed  against  systematic 
and  nonsystematic  risks.   Further,  the  variance  of  residuals  remain 
positive  and  significant  only  in  period  3  where  it  is  the  only  explan- 
atory variable.   The  significance  of  residual  variance  in  period  1  is 
primarily  due  to  the  existence  of  multicollinearity.  The  effect  of 
multicollinearity  can  be  seen  more  clearly  when  the  average  security 
return  is  regressed  against  the  residual  variance  alone.   The  coefficient 
of  residual  variance  has  negative  sign  and  is  insignificant.   The 
relationship  between  the  coefficients  in  the  first  and  third  regressions 
in  the  first  period  can  be  described  as  follows 

c  =  c*  +  b*d 

where  b*  and  c*  are  the  coefficients  of  beta  and  residual  variance  in  the 
first  regression,  c  is  the  coefficient  in  the  third  regression,  and  d  is 
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Table  3 

Trans  formation  of  independent  and  dependent  variables 

Period  1 

Y.  =  .00340  +  .004283.  -  .31708cr".  R~  =  .040 

3  3  ej 

(3.763)    (4.396)    (-2.303) 


Y.  =  .00332  +  .003223.  R2  =  .029 

3  3 


(3.667)    (3.737) 


Y.  =  .00663  -  .03258a2.  R2  =  .000 

3  ej 


(12.377)   (-.263) 


Period  2 


Y.  =  -.00066  +  .002693,  -  .11808a2.         R2  =  .012 
(-.619)    (2.293)    (-.883) 


Y.  =  -.00053  +  .002033.  R2  =  .010 

3  3 


(-.498)    (2.238) 


Y.  =  .00133  +  .07539a2,  R2  =  .001 

3  ej 


(2.091)    (.724) 


Period  3 


Y.  =  -.00220  +  .001213.  +  .20626a2.         R2  =  .016 
3  J         ej 

(-1.614)    (.832)     (1.810) 


Y.  =  -.00246  +  .002633.  R2  =  .009 

J  3 


(-1.812)    (2.137) 


,2  . 


Y,  =  -.00123  +  .25729a2.  R"  =  .015 

J  ej 

(-1.735)    (2.680) 
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the  auxiliary  regression  coefficient  of  regressing  beta  against  residual 

variance.   The  value  of  d  is  equal  to  y        9  where  y        ,,  is  the 

S,S"  °   2        B,s" 
s 
simple  correlation  between  beta  and  residual  variance,  and  a     and  a  n 

s^ 

are  the  standard  deviations  of  beta  and  residual  variance  respectively. 

Using  the  values  of  these  elements,  we  found  the  value  of  d  is  66.406, 

with  y     =  .47.  Thus  c  =  -.31708  x  (.00428  *  66.406)  ~   -.033,  which  is 

6,  a 
close  to  the  coefficient  of  residual  variance  in  the  third  regression. 


B.   CES  Function  Approximation  of  CAPM 

The  risk  return  relationship  based  on  a  homogeneous  two  parameter 
preference  structure  and  an  equilibrium  market  can  be  written  as 

(8)  E(HR.)  =  d-S*)HRf  +  B*E(hV 

where  R.,  ,TR,.  and  TTR  are  rates  of  return  in  terms  of  true  investment 
h  j   H  f     H  m 

horizon,  H.   If  the  true  investment  horizon  is  unknown  than  the  fol- 
lowing relationships  hold: 

E(R  )  =  [E(NRj)]X 

E(R  )  =  [E(  R )]A 

m       Mm 

E(Rf)  =  [E(NRf)]X 

where  R.  represents  the  rate  of  returns  in  terms  of  observed  horizon, 

N  l 

H  3 

and  X  =  — .   Equation  (8)  becomes 

N 

(9)  E(R.)X  =  (1-6 *)RfA  +  s!E(Rm)A 

which  is  a  CES  type  function  of  CAPM  derived  by  Lee  (1976).   Equation 
(9)  can  be  approximated  by  equation  (10) : 
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(10)  lnE(R;)  =  (1-3*)  lnRr  +  3*  lnE(R  ) 

+  i-  AB*(1-B*)  [lnE(R  )-lnRj2 
2    j     j        m     f 

where  \  measures  the  ratio  of  true  horizon  to  observed  horizon  and 

4 
the  degree  of  skewness  and  kurtosis.   Equation  (10)  is  employed  to 

test  the  functional  form  of  risk  return  relationship,  and  to  test  the 

significance  of  skewness  and  kurtosis.   If  the  coefficient  for  the 

quadratic  market  excess  rate  of  return  is  significant,  then  the  higher 

moments,  especially  skewness  will  affect  the  risk  return  tradeoff,  and 

the  linear  model  is  subject  to  specification  bias. 

Define 

(11)  r  =  jXB*(l-6*) 

The  relationship  between  6.  and  r.  can  be  written  as 

1  3 

(12)  0.  =  B*  +  br. 

J    J     3 

where  b  is  the  auxiliary  regression  coefficient  of  regressing 

(In  E(R  )  -  ln(R,))   against  (In  E(R  )  -  ln(Rr))  and  6.  is  the 
m        r  m        r        j 

systematic  risk  estimated  by  the  linear  characteristic  line.   8.  will  be 

J 

biased  downward  or  upward  depending  upon  the  sign  of  b  and  y. 

2.      Quadratic  Risk  Premium  Market  Model 

Following  Kraus  and  Litzenberger  (1976),  quadratic  characteristic 
lines  consistent  with  the  three  moment  capital  asset  pricing  model  can 
be  expressed  as 

(13)  R   -  R.  -  cn.  +  c.  .(R  -R-J  +  c,.(R  -R  J2  +  n . 

it    ft    Oi    li  mt   ft     21  mt  mt      l 


-18- 

Ihe  systematic  risk.  3  and  systematic  skewness  Y  for  the  ith  security 
can  be  written  as 

(14)  5.  =  c,  .  +  c,.  (M3/a2) 

i    lx    2x   ra  m 

(15)  Y.  =  c.  .  +  c0.  [M4  -  a4]/!!3 

l    lx    2i   m    mJ  a 

3      4 
where  M  and  M  are  skewness  and  kurtosis  of  the  market  rate  of  return. 
m     m 

Thus,  if  equation  (13)  is  used  to  estimate  risk  return  relationship, 
the  value  of  systematic  risk  and  skewness  will  be  affected  by  the  third 
and  fourth  moments,  unless  the  value  of  c.  is  equal  to  zero.  When 
this  occurs,  the  linear  characteristic  line  holds  for  the  ith  security, 
and  the  three  moment  capital  asset  pricing  model  reduces  to  the  tradi- 
tional two  moment  model. 

Note  that  equation  (13)  is  just  one  alternative  to  estimate 
the  market  model.   The  quadratic  characteristic  lines  can  be 
formulated  in  a  form  which  is  consistent  with  the  generalized  func- 
tional specification  of  CAPM  as  indicated  in  equation  (10) .   This  type 
of  characteristic  lines  can  be  written  as 

(16)  Rit  -  Rft  =  c*Qi   +  ^(V-R^)  +  c*2i(Rmt-Rft)2  +  x. 
Again  the  relationships  between  3,  Y,  c.  ,  and  c?  are 

(17)  B.  =  c*  +  c*  [mV  +  2(R  -R  )] 

x    lx    zx  m  m     m  r 


(18)      y,  =  c*  +  c*  [(M3-a4)/M3  +  2(R -R  )] 
l    lx    Zi   m  m   m      m  r 
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Iquation  (16)  provides  a  linkage  between  three  moment  capital  asset 
pricing  model  and  the  generalized  functional  transformation.   The  only 
difference  between  equation  (16)  and  equation  (10)  is  that  the  former 
specifies  the  risk  and  discrete  rate  of  return  tradeoff  in  a  finite 
horizon,  while  the  latter  represents  the  relationship  between  risk 
and  continuous  compounding  rate  of  return.   Therefore,  equation  (16) 
not  only  is  consistent  with  the  three  moment  capital  asset  pricing 
model  but  also  implicitly  eliminates  the  statistical  problems  by  nor- 
malizing the  disturbance  term  to  satisfy  the  classical  assumptions  for 
least  squares. 

Table  4  summarizes  the  regression  results  for  equations  (10),  (13), 
and  (16).   c  and  c~  correspond  to  B.  and  y.    in  equation  (10).   The 
average  systematic  risk  estimated  from  the  linear  characteristic  line 
is  also  listed  for  comparison.   The  results  of  three  quadratic  character- 
istic lines  appear  similar.   The  number  of  securities  with  t  values  greater 
than  one  for  the  coefficients  of  the  quadratic  term  for  each  regression 
model  is  reported  in  the  last  column  of  Table  3.   For  those  securities 
with  the  coefficients  of  the  quadratic  term  significantly  different 
from  zero,  the  nonlinear  characteristic  line  holds  and  the  three  moment 
capital  asset  pricing  model  becomes  relevant. 

V.   Estimation  Error  and  the  Performance  of  Mean-Absolute-Deviation 
Versus  Least  Squares  Estimators 

Characteristic  lines  are  often  fitted  to  historical  returns  data 

by  least  squares.   For  a  certain  class  of  distributions  with  fat  tails, 

the  Gauss-Markov  theorem  no  longer  applies  to  least  squares.   As  a 

result  of  giving  more  weights  to  outliers,  least  squares  (LS)  becomes 
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Table  4 


Average  Values  of  Coefficients  and  R" 


Period  1 

-1 
c 

-2 
c 

I2 

n* 

Equation  10 

1.0246 

.9535 

.25 

117 

Equation  13 

1.0550 

1.1590 

.25 

127 

Equation  16 

1.0438 

1.2728 

.25 

128 

Linear  model 

1.0067 

Period  2 

Equation  10 

1.1008 

2.1692 

.23 

166 

Equation  13 

1.1018 

2.653 

.29 

162 

Equation  16 

1.0962 

2.700 

.29 

160 

Linear  model 

1.1036 

Period  3 

Equation  10 

1.0460 

.3204 

.33 

179 

Equation  13 

1.0463 

.2776 

.33 

190 

Equation  16 

1.0413 

.2604 

.33 

188 

Linear  model 

1.0607 

*n  represents  the  number  of  securities  which  have  t  values  greater  than 
one  for  the  coefficients  of  the  quadratic  term. 
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extremely  sample  dependent.  An  alternative  regression  model  less  sen- 
sitive to  outliers  becomes  more  desirable.   Because  of  its  simplicity, 
mean-absolute-deviation  (MAD)  estimator  has  often  been  selected  as  a 
good  candidate,  at  least  for  the  preliminary  estimates.   By  giving 
relatively  small  weights  to  outliers,  MAD  regression  may  provide  a 
more  efficient  estimator. 

To  state  the  above  argument  more  formally,  let 

(20)        !Jt-»Jt  +  v 

where 

AM 

8   =  MAD  beta  estimate  for  security  j  in  period  t. 

*L 

8.   =  LS  beta  estimate  for  security  j  in  period  t. 
Jt  j    j  t- 

B.   =  the  assumed  true  value  of  beta  risk  for  security  i. 
Jt 

When  the  distribution  of  security  returns  exhibits  high  kurtosis,  as 
evidenced  by  our  test,  it  is  expected  that  the  dispersion  of  MAD  esti- 
mates to  be  smaller  than  that  of  LS  estimates.  That  is,  assuming  that 
6.   and  disturbance  terms  are  uncorrelated. 

(21)      Var(S^t)  =  Var(2jt)  +  a1 


(22)      Var(3L J  =  Var(S . J  +  a2 . „ 
j  t       j  t    nj  t 

and      Var(2M  )  <  Var(SL  ) . 
Jt        Jt 
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This  implies  that  a    .      -  a  .  .   If  the  estimated  beta  risk  in  the 

£jt    njt 

previous  period  is  used  to  predict  the  beta  risk  in  the  current  period, 
the  forecasting  model  can  be  written  as 


(23)  Bjt  =a+bSjt_1+u 


where  3.   and  6.   ,  are  either  MAD  or  LS  estimates.   Again  the  tradi- 
jt      jt-1 

tional  Mincer  and  Zarnowitz  method  can  be  used  to  analyze  the  fore- 
casting errors.   However,  the  independent  and  dependent  variables  are 
now  measured  with  error.   This  proxy  error  will  lead  to  the  under- 
estimation of  b  coefficient.  To  see  this  more  clearly,  let 

cm,        ^.'"(VVx) 

(25)      bL  =  1*     ^   X 

where  b  and  b   represent  the  slope  coefficients  of  equation  (23)  for 
MAD  and  LS  regression  estimates  respectively.   Substitute  equations 
(19),  (20),  (21)  and  (22)  into  equation  (24)  and  (25)  and  assume  that 
the  covariances  for  cross-period  disturbance  terms  are  zero  (no  serial 
correlation) , 


Cov(B   ,B    ) 
(26)      bM  =  it_lt-l 


(27)      b"  = 


Var(S.t.1)+a2.t_i   1  +  ^ 


Cov(Sit:'3it-l) 


^it-l^t-1   l+   q^ 


Var(3.t.1) 
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1  i  V  T 

5ir.ce  J  ,    <  a    .      ,    and  both  are  positive,    b   >  b     >  b     where  b 

ejt-1         njt-1 

represents    the   estimated   coefficient  of    equation   (23)   when   there  is 
no   proxy   error. 

The  value  of   the   coefficient  of   determination  of   equation   (24)    is 
also  affected.      Since   there  is  only  one  explanatory  variable,    the  value 
cf   the  squared  simple  correlation  is   equal    to    that  of   the   coefficient 
of   determination. 

Cov(8       8       J2 

(28)  R"  =  2--   ,jt  X 

a*     aA 
Bjt  Bit-1 

For  simplicitv,    assume   that   the  variances   of   6.      and  disturbance 

jt 

terms  are  constant  over  time.  Denote  IL.  and  IL  as  coefficients  of 
determination  for  MAD  and  LS  estimates  respectively.   Then 


C0v(3it'2it-1)2 
(29)  R"   =  1= — i=-J= — 

v   Sjt     ejt/ 


2       Cov(Bit'Bit-l)2 

^    *%  -  ■  2  y\2 

Bit     nit' 


2  9 

which  implies    that  R^  >   RT.      Rewrite   the  Mincer-Zarnowitz   MSE   decompo- 
sition as 

->2  n2 

MSE   =    (3"c-6"c_1) 2  +   (l-b)^t_1  +   (1-R2)^t 

Thus  it  is  expected  that  both  inefficiency  and  random  components  to  be 
smaller  for  the  MAD  estimates. 
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Two  methods  are  often  used  to  estimate  the  MAD  regression  co- 
efficients.  The  first  one  is  the  iternative  procedure  proposed  by 
Karst  (1958)  and  Sharpe  (1971).   Sharpe's  method  is  based  on  a  normal 
equation  expressing  the  sum  of  absolute  deviation  as  a  piecewise  con- 
vex function  of  the  beta  estimate.   Although  Sharpe  designed  an  algo- 
rithm to  solve  problems  in  only  two  dimensions,  his  method  can  be  ex- 
tended to  multi-dimensional  problems.   However,  as  the  number  of  in- 
dependent variable  increases,  the  computing  procedures  become  cumber- 
some. 

The  second  method  is  the  linear  programming  model  suggested  by 
Charnes,  Cooper  and  Ferguson  (1955),  and  Wagner  (1959).   Linear  pro- 
gramming is  especially  suitable  for  regression  with  more  than  one 
independent  variable.   A  primal  algorithm  proposed  by  Barrodale  and 
Roberts  is  adopted  here.   A  feature  of  this  algorithm  is  its  ability 
to  pass  through  several  simplex  vertices  at  each  iteration.   Our  exper- 
ience shows  this  algorithm  is  more  efficient  than  the  generally  ac- 
cepted dual  algorithm. 

In  order  to  examine  the  performance  of  MAD  relative  to  LS  estimates 
more  closely,  data  are  once  again  separated  into  two  groups,  that  is, 
returns  distribution  with  and  without  significant  high  kurtosis.   It 
is  well  known  that  fat  tail  is  associated  with  high  kurtosis.   This 
type  of  grouping  allow  us  to  evaluate  the  performance  of  MAD  estimates 
more  easily.   The  performance  of  these  two  estimators  are  measured  ac- 
cording to  their  forecasting  capability. 

The  empirical  results  for  MAD  estimates  are  reported  in  Table  5. 
When  compared  with  Table  2,  it  is  somewhat  surprising  that  LS  estimates 
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Table   5 
The   Stability  of  Mean-Absolute-Deviation  Beta   Estimate* 


Group 


Slooe 


Mean  Square  Errors 


bias 


inefficiency        random 


total 


(1)    Normality 

1959-65     .23    .516 
1966-72 

1966-72     .31    .473 
1973-79 


.0079032 

.0334055 

.1267178 

.1680265 

5% 

20% 

75% 

.0070728 

.0450288 

.0795368 

.1316385 

5% 

34% 

61% 

(2)  Significant 
kurtosis 


1959-65 
1966-72 

1966-72 
1973-79 


22 


25 


,550 


.490 


.0265136 

.0309154 

.1613588 

.2187879 

12% 

14% 

74% 

.0000435 

.0582932 

.1599715 

.2183082 

0% 

27% 

73% 

(3)    All   companies 

1959-65  .22 

1966-72 


,524 


.012528 
7% 


.033164 
18% 


.138047 
75% 


,183739 


1966-72 
1973-79 


.28 


.478 


,003696 
2% 


.048597 
32% 


.101566 
66% 


.153859 


*The  estimated  regression  model   is 


Beta     =  bQ   +  b.    Beta 
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perform  better  than  MAD  estimates.  While  the  general  patterns  of 
MSE  are  similar,  in  most  cases,  LS  estimates  provide  smaller  fore- 
casting error  for  each  MSE  component:   bias,  inefficiency  or  random. 
The  R  squares  are  also  higher  and  the  b  coefficients  of  equation  (23) 
are  larger  for  each  group  indicating  that  LS  estimates  are  more  effi- 
cient than  MAD  estimates. 

To  examine  the  relation  between  MAD  and  LS  estimates,  we 
regress  the  former  against  the  latter.   This  type  of  regression  pro- 
vides a  scheme  to  see  how  close  these  two  estimates  are.   As  shown  in 
Table  6,  for  the  first  group  the  difference  between  these  two  estimates 
is  mostly  random.   However,  for  the  second  group,  the  bias  portion  of 
MSE  increases  substantially.   This  phenomenon  may  reflect  the  fact 
that  LS  and  MAD  are  mean  and  median  regression  estimators  respectively, 
and  in  the  presence  of  high  kurtosis  the  difference  between  both 
estimates  becomes  more  significant.   To  provide  further  insights  on 
the  performance  of  these  two  estimators,  the  cross-sectional  risk- 
return  tradeoff  and  the  relationship  between  performance  measure  and 
risk  were  also  examined.   Again,  in  every  case,  LS  estimates  provided 
better  or  at  least  no  worse  results.   Our  findings  therefore  question 
the  superiority  of  MAD  estimator  in  estimating  the  market  model  for 
the  monthly  returns. 

There  are  at  least  several  reasons  to  explain  why  MAD  estimator 
did  not  outperform  LS  estimator.   First,  the  existence  of  misspecif ica- 
tion  is  likely  to  affect  the  regression  results.   As  shown  in  Table  4, 
for  a  number  of  securities  the  coefficient  representing  skewness  of 
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Tafale  6 
Mean-Absolute- Deviation  versus   Least  Squares    Beta  Estimates* 


Group 


Slope 


Me  an  Square  Errors 


bias 

inefficiency 

random 

total 

(1) 

No  rmal  i  ty 

1959-65 

.37 

1.006 

.0000588 

0% 

.0000047 
0% 

.0175168 
100% 

.0175804 

1966-72 

.88 

.968 

.0000122 
0% 

.0001479 
1% 

.0193189 
99% 

.0194791 

1973-79 

.33 

1.036 

.0020884 
9% 

.0001211 
1% 

.0200317 
90% 

.0222412 

(2) 

Significant 
kurtosis 

1959-65 

.79 

.937 

.0081234 
21% 

.0005458 
1% 

.0306277 
78% 

.0392970 

1966-72 

.84 

.916 

.0066912 
15% 

.0015721 
4% 

.0349996 
81% 

.0432630 

1973-79 

.79 

.957 

.0066912 
17% 

.0002352 
1% 

.0314874 
82% 

.0384139 

(3) 

All   companies 

1959-65 

.84 

.977 

.001176 
5% 

.000064 
0% 

.023185 
95% 

.024425 

1966-72 

.86 

.944 

.000338 
2% 

.000533 
2% 

.024709 
96% 

.025580 

1973-79 

.78 

.964 

.001317 
4% 

.000151 
0% 

.031170 

96% 

.032638 

*The  estimated  regression  model  is 


Beta  (MAD)  =  a,,  +  a,  Beta  (LS) 
t         0    1     t 
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the  returns  distribution  is  significant.   Second,  when  the  distribution 
of  stock  returns  is  skew,  the  linear  model  no  longer  represents  the 
appropriate  functional  form.   This  being  the  case,  it  is  difficult  to 
expect  the  MAD  estimator  to  improve  the  empirical  estimation.   Third, 
the  error  in  variable  problem  resulted  from  market  index  proxy  may 
considerably  limit  the  usefulness  of  the  MAD  estimator.   Finally, 
Zellner  (1976)  shows  that  for  the  multivariate  t  distribution,  LS 
estimator  is  the  maximum  likelihood  estimator.   Blattarg  and  Gonedes 
(1974)  have  provided  evidence  that  the  t  distribution  fits  the  stock 
returns  distribution  very  well.   The  sample  variances  of  security  re- 
turns examined  in  the  study  are  all  well  behaved  and  have  finite  value. 
Our  results  tend  to  reject  the  hypothesis  of  stable  class  distribution. 

To  further  examine  the  performance  of  the  MAD  estimator,  a  group 
was  formed  to  consist  the  data  with  high  kurtosis  and  no  skewness. 
Again,  the  LS  estimator  outperforms  the  MAD  estimator  in  every 
aspect.   Therefore,  this  study  provides  further  evidence  that  even  in 
the  existence  of  kurtosis,  the  MAD  estimator  is  not  necessarily 
efficient.   In  their  Monte  Carlo  study,  Rupport  and  Carroll  (1980) 
also  indicate  that  MAD  estimator  will  outperform  LS  estimator  only 
under  the  restriction  that  kurtosis  is  really  high. 

V.   Summary 

In  accordance  with  the  modern  statistical  theory  and  econometric 
methods,  this  paper  investigates  the  impacts  of  skewness  and  kurtosis  on 
risk  estimation  and  determination.   First,  impacts  of  kurtosis  on  the 
total  risk  estimation  are  analytically  and  empirically  studied.   Second, 
the  relationships  between  stability  of  beta  coefficients  and  the  third 
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and  fourth  moments  of  security  race  ~f  return  are  analyzed  in  detail.   Third, 
the  impacts  of  skewness  and  kurtosis  on  risk  decomposition  are  determined. 
Finally,  the  concepts  and  measurement  of  estimation  errors  are  used  to  de- 
termine whether  the  0L3  or  the  MAD  method  should  be  used  to  estimate  beta 
coefficients.   In  sum,  this  paper  has  demonstrated  that  both  skewness  and 
kurtosis  of  security  rates  of  return  should  be  concerned  in  estimating 
and  determining  the  risk  in  capital  asset  pricing. 
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Footnotes 

Scott  and  Horvath  (1980)  shows  that  investor  exhibiting  consis- 
tent risk,  aversion,  strict  consistency  of  moment  perference  and  positive 
perfarencs  for  positive  skewness  will  have  negative  perference  for  kurtosis, 

7 

"Our  model  differs  frcm  that  of  Friend  and  Westerfield  (1980,81). 

The  problem  of  market  index  is  not  the  major  concern  of  this  study,  while 

Friend  and  Westerfield  did  not  consider  the  effect  of  kurtosis. 

3 
The  N  used  to  indicate  the  observe  horizon  is  omitted  to 

simplify  the  notation. 

4 
Cootner  (1962)  and  others  show  that  the  degree  of  skewness  and 

kurtosis  is  a  function  of  the  length  of  observated  horizon. 
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